Picture for Hangjun Ye

Hangjun Ye

OneVLA: A Unified Framework for Embodied Tasks

Add code
May 31, 2026
Viaarxiv icon

LVDrive: Latent Visual Representation Enhanced Vision-Language-Action Autonomous Driving Model

Add code
May 21, 2026
Viaarxiv icon

Beyond Imitation: Learning Safe End-to-End Autonomous Driving from Hard Negatives

Add code
May 19, 2026
Viaarxiv icon

RotVLA: Rotational Latent Action for Vision-Language-Action Model

Add code
May 13, 2026
Viaarxiv icon

PointForward: Feedforward Driving Reconstruction through Point-Aligned Representations

Add code
May 12, 2026
Viaarxiv icon

OneVL: One-Step Latent Reasoning and Planning with Vision-Language Explanation

Add code
Apr 20, 2026
Viaarxiv icon

XEmbodied: A Foundation Model with Enhanced Geometric and Physical Cues for Large-Scale Embodied Environments

Add code
Apr 20, 2026
Viaarxiv icon

DriveVA: Video Action Models are Zero-Shot Drivers

Add code
Apr 05, 2026
Viaarxiv icon

UniDriveVLA: Unifying Understanding, Perception, and Action Planning for Autonomous Driving

Add code
Apr 02, 2026
Viaarxiv icon

Toward Physically Consistent Driving Video World Models under Challenging Trajectories

Add code
Mar 25, 2026
Viaarxiv icon